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We consider the sum of the incurred start-up costs of a single unit in a Unit 
Commitment problem. Our major result is a correspondence between the facets 
of its epigraph and some binary trees for concave start-up cost functions CU, 
which is bijective if CU is strictly concave. We derive an exponential ^-rep¬ 
resentation of this epigraph, and provide an exact linear separation algorithm. 
These results significantly reduce the integrality gap of the Mixed Integer formu¬ 
lation of a Unit Commitment Problem compared to current literature. 


1 Introduction 


Start-up costs have been an integral part of the Unit Commitment problem since its first 
formulation Garver (1962), and gain further importance due to the increasing penetration of 
variable renewable energy. Electricity production from such sources, i.e. mainly from wind 
and solar, increases the variability of the net load met by thermal generators, resulting in a 
more frequent cycling and a higher number of start-ups (Kumar et al. (2012)). Hence, the 
percentage of start-up costs within the total production costs rises and an accurate modeling 
of the start-up costs becomes increasingly important. 

In contrast to the detailed incorporation of start-up costs in other methodologies (e. g. La- 
grangean Relaxation, Dynamic Programming, meta-heuristics), Mixed Integer Programming 
formulations were restricted to constant start-up costs (Garver (1962|) for a long time. The 


first model capable of representing arbitrary start-up cost functions was introduced in Arroyo 


and Conejo (2000). Since then, two major formulations have emerged: the model of Carrion 


and Arroyo (2006), based on Nowak and Romisch (2000), and the model of Simoglou et al. 


( 2010p . Their comparison in Morales-Espana et al. (20131 highlights the importance of a 
low integrality gap, demonstrating a significantly reduced solution time. This motivates a 
thorough polyhedral analysis. 

In a Unit Commitment Problem, the operational state of each unit i £J?is represented by 
a binary vector v,.£ {0, l} r , where for each period t € [T], vf = 1 iff the unit is online. As 
defined in (|7]), we denote by DCUf the summed start-up cost function, such that DCUf (v ; ) 
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equals the total start-up costs incurred by unit i in the course of v r Our major result is 
a correspondence between the facets of conv(epi(DCUf)) and binary trees of size T for 
concave start-up cost functions CU;, which is bijective if CU,- is strictly concave. We derive 
the exponential class of binary tree inequalities (BTIs), which together with the trivial 
inequalities 0 < v/ < 1, define an ^-representation of conv(epi(DCUf)), and give an exact 
linear separation algorithm for the BTIs. 

This result is important, both from the theoretical and practical point of view, since 

• the BTIs, and in particular the proof of Lemma [15} strengthen our understanding of the 
start-up costs, 

• the existing start-up cost models are extended formulations of conv(epi(DCUf)), and 
may therefore be analyzed and tightened using the BTIs, and 

• separated in a cutting plane approach, the BTIs reduce the integrality gap of the current 
state-of the art models (Carri on - and Arroyo] (2006); Simoglou et al. (20101) to match 
those of the the temperature model ( Sllbernagl et al.| (2014i). 


2 Preliminaries 


Every unit incurs costs on start-up, mainly due to the need for reheating and extra maintenance 
to mitigate the effects of thermal stress. These start-up costs depend on the length L of the 
preceding offline time, and are assumed to be given for each unit i as 


CU,-: 


L i->- cost after offline time of length L. 


( 1 ) 


CU,- is usually described as “exponential” ((Wood and Wollenberg|(jl996j)), and defined as 

W E *Z.«=»*,: .^>0. (2) 

10 else. 

with parameters Vj > 0. F, > 0, A,- > 0 (cf. Fig. [I]). 



In Silbemagl et al. (2014) we present how to exploit the exponential definition of the 
start-up costs. Here however, we only need CU,- to be non-negative, increasing, and concave, 
with CU,-(0) = 0. 

The modeled time range is discretized into T periods of possibly varying lengths L 1 ,..., L 7 
Furthermore, we assume the unit’s offline time prior to period 1 to be given as PDT, (pre¬ 
model offline fime). To simplify notation, we also discretize the start-up cost functions such 
that CUj(Z) denotes the costs of a start-up of unit i in period t, which was offline during 
the previous / periods. To this end, we define the function OL f (Z) as the offline time in that 
situation, that is 


VZG [T],l€ [0..Z-1]: 


OL'(0:=|Er L '"' 


if Z<f-1, 


L‘~J + PDT,- if / = f — 1. 


(3) 
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The above case distinction differentiates between the case where the offline time lies 
completely within the modeled time range and the case where it stretches to include the 
pre-model offline time PDT,. The cones ponding start-up costs CLJ'(/) are 

Vf £ [T], / £ [0 .. f—1] : CUj(Z) :=CU,-(OL r (f|). (4) 


As noted, the operational state of a unit is modeled by the variables v/ £ {0,1}, where 
v/ = 1 iff unit i is online in period t. Deriving the start-up costs in each period t from these 
variables yields the discrete start-up cost functions 


£ [r] : 


DCU' : 


{0, l} r —> M>o, 


fcuJK^)) 


where 


of (v,) := max{/ £ [t — 1] | v/ 1 = ... = v/ 1 = 0}. 


if vf = 1 
else, 


(5) 

( 6 ) 


In Fig. [2]this relationship is depicted for an exemplary operational schedule. Note that ol and 
OL follow the notational convention of / and L to denote numbers of periods in lower case 
and time lengths in upper case. 
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Fig. 2: Incurred start-up costs for an exemplary operational schedule (L 1 = ... = L 7 = I). 
The operational schedule v ( - £ {0, I} 7 over the course of T = 10 periods is shown in green. 
The first start-up takes place in period 2, with costs DCU 7 (v,-) of CU;(3) due to the preceding 
offline period with length L 1 = 1 and the pre-offline time PDT,- of 2. The only other start-up 
costs are incurred in period 9, after three offline periods with a total length of L 6 +L 7 + L 8 = 3. 


In virtually all published Unit Commitment formulations which consider start-up costs, 
the sum of the start-up costs is minimized, motivating the definition 

T 

Vi£J?: DCUf: {0,l} r ^M> 0 , v,- i-> £dCU 7 (v,-). (7) 

t =l 

Using DCUf, any Unit Commitment problem with start-up costs can be written as 

min| c(vj,...,, z) + £ DCUf (v,-) 

with additional variables z £ M", feasible set & C {0,1} r '\ M ,! , and objective function c. 
For example, in a typical Unit Commitment problem z may stand for electricity output and 
cost variables, and & may model production limits and costs, ramping speed limits, and 
minimal up- and downtime. Recognize, even in case the objective c is linear, due to DCUf 
the extended objective is non-linear. 


(v : 




z) £ & 


( 8 ) 
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Since the domain {0, I } 7 of DCUf is finite, the convex hull of its epigraph, 

conv(epi(DCUf)) = conv^{(v,-,cuf) | v ; € {0, l} r ,cuf > DCUf(v; 

= conv(V £ ) +pos(u 7 ’ + i), 


( 9 ) 


is a polyhedron with unbounded direction ut+ t (where ut .i denotes the unit vector in the 

direction of the last coordinate cuf), and with vertices 


= {( v pDCUf (v ; )) | v,- £ {0, l} 7 }- 


( 10 ) 


The domain {0, I } T of DCUf is extremal in its convex hull [0, l] 7 . Therefore, using the 
convex hull corresponds to extending DCUf to fractional vectors v t £ [0, l] r , i. e. to the 


unique function LCUf with 


epi(FCUf) = conv(epi(DCUf)). 


( 11 ) 


So, by introducing the variables cuf, the minimization problem ([8]) can be rewritten as 


mm 


c(v 


11 • 


^)+L cu ; 

iEJ? 


Vi £4^: (v f ,cuf) £ epi(LCUf), (v x .,V|^r|,z) £ (12) 


If c is affine linear and & is expressible in a Mixed Integer Program (MIP), then ( fl2| ) is a 
MIP as well, since the polyhedral epi(FCUf) may be modeled using its ^-representation in 
Q (see Dantzig and Wolfe ( |1960| ) for details). 

Due to the 2 T vertices of epi(LCUf) however, this would lead to an excessive model 


size. All existing approaches, including among others Nowak and Romisch (2000); Carrion 


and Arroyo ( |2006| ; [Simoglou et al. (2010), mitigate this by modeling sets which enclose 


epi(LCUf), in ^-representation. In this publication, we provide an exact ^-representation 
of epi(FCUf), accompanied by an exact separation algorithm with running time &{T). At 
its core, our contribution results from four observations: 

1. Iteratively lifting a trivial inequality variable by variable results in facets of epi(LCUf) 
and in coefficients which can be derived explicitly from the lifting order. 

2. Multiple lifting orders result in the same facet. Each lifted facet is already uniquely 
described by a special partial order, which can be expressed as a binary tree, warranting 
the name binary tree inequality (BTI) for the lifted inequalities. 

3. The binary tree associated to a lifted facet readily identifies the points on this facet. 
Furthermore, these facets describe the complete lower boundary (with respect to the last 
coordinate cuf) of the epigraph, proving that it does not have further non-trivial facets. 

4. For a fractional point, a suitable binary tree and the coefficients of the respective lifted 
facet can be determined efficiently. 

These observations are presented as follows: 

• in Section [3] the result of lifting a trivial inequality is analyzed, 

• in Section [4] special notation for binary trees is introduced, 

• in Section [5] the so-called binary tree inequalities are derived and shown to be facet- 
inducing, 

• in Section[6]these inequalities are shown to complete the ^-representation of the epigraph 
of the summed start-up costs, and finally 

• in Section [7] an exact separation algorithm is presented. 

Since all these sections consider the epigraph epi(FCUf) of a single unit, we drop the unit 
index i in the following. 
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3 Lifting Inequalities 


In this subsection, we observe that by lifting the trivial inequality cu £ > 0, we gain facet- 
inducing inequalities with explicitly derivable coefficients. 

The choice of cu E > 0 as a starting point is motivated by the properties of DC if, which im¬ 
ply that LCU 1 is homogeneous, i. e. that ALCU £ (v) = LCU £ (Av) for all A G [0,1], v G [0, l] 1 


(this follows explicitly from Silbernaglj ( tion) and implicitly from Corollary |T7] later). It holds 
for all ( x,y ) G [0, l] 7 X M, A G (0,1] that 


(v,cu £ ) G epi(LCU £ ) 


cu £ > LCU £ (v) <=^ Acu £ > ALCU £ (v) = LCU £ (Av) 
A(v,cu £ ) G epi(LCU £ ). 


This means, epi(LCU z ) is the intersection of a cone with [0, l] r X M, and thus all facets of 
epi(LCU £ ), except those induced by 0 < v 1 < 1, must also be facets of this cone. Hence, the 
facet-inducing inequalities are homogeneous, i. e. without constant term, and since cu z is not 
bounded from above in epi(LCU z ), of the kind 

cu 1 > £ a t v‘. (13) 

fe[r] 


Starting from cu £ > 0, such facets can be identified using the standard sequential lifting 
method (see e. g. Bertsimas and Weismantel (2005)). For each order a : [T] —> [T], this 
method determines the coefficients OC a (j) consecutively by considering the start-up costs for 
operational schedules v with coordinates a(j + 1 o(T) fixed to 0, 

V y G [0..T] : F(d):={vG{ 0,l} r VA G [j+l .. T] : = o}. 

Here, each J£A(ct) extends JF 7 ” 1 (a) by all v with v- = 1, 

V; G [T] : &l(o) := = {v G ^'(a),v ff(7) = 1}. 


Being fulfilled by 0 G epi(LCU £ ) with equality, cu 1 > 0 induces a 0-dimensional face of 
epi(LCU £ ). By determining the coefficients a Gi as 


f ■ 7 “ 1 

V./ G [T] : a aU) := min DCU £ (v) - £ a a{k) v 

l k= 1 

the sequential lifting method produces a series of inequalities defining /-faces of epifLCU^), 
culminating in a facet of epi(LCU £ ). 

We claim that the minimum in (14] ) is attained by the vectors v • ( a) G ^((o) with 

Vt G [T] : Vj (a) :=Vj_ x {p) + Uj, (15) 

using v 0 (a) := (0,... ,0). The coefficients of the lifted inequality would thus equate to 
Vj G [T] : a aU) = DCU £ (v ; -(a)) — DCU e (v ; -_ 1 (<t)) 

v v (lo) 

= DCU^y^a) + u aU) ) -DCU 5 ^ (a)). 

To prove this, a closer look at the change of the summed start-up costs that defines a a(; , 
is necessary. This change depends on v -_ 1 (o f ) and o(j), and not on the relative lifting order 
of ct(1), .... o{j — 1) and a(j + 1),..., <J(T). We express this by considering v, v G {0, 1} T 
and t G [7] with v = v + u t in place of v-_j(<7), Vj(o), and a(j). 
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As can be seen in Fig. [3j there are at most two indices t' £ [T] such that the start-up costs 
CU ( (v) and CU f (v) differ, 

1. the index t itself, and 

2. the minimal index q £ [f+1 .. T] with v q = 1, if such an index exists. 

These start-up costs depend on the number of offline periods immediately before and 
after period o(j). The number of offline periods preceding o( j) is given by ol CT W (y) (see 
Q). For each v £ {0, l} r , t £ [T], we define a symmetric function for the offline periods 
succeeding t as 

or'(v) := max{ j £ [0 .. T-t] | v' +1 = v t+2 = ... = v t+j = 0}. (17) 








t 
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Fig. 3: A step of the sequential lifting method, from the vector v to the vector v. By set¬ 
ting vj = 1, the offline time of length ol' (v) + or' (v) + 1 is split into two offline times of 
lengths ol' (v) and or f (v), thereby changing the summed start-up costs. 


So, for two operational schedules v,v£ {0, I } T which differ solely in period t. the start-up 
costs are unequal in periods t and t + or f (v). Abbreviating / := ol' (v) and r := or'(v), we 
obtain 


DCU £ (v) — DCU £ (v) 


DCU ( (v) + DCU f+r+1 (v) - DCU f+r+1 (v) if t + r < T , 
DCU ( (v) else. 


Thus, the change in the summed start-up costs depends only on the offline periods prior and 
after t, and can be further simplified. 


Proposition 1 For each v,v £ {0, l} r and t £ [T] with v = v + u t , it holds 


DCU e (v) - DCU e (v) = 8 r (oh (v), or f (v)), 

where 

8 f : [0 .. f —1] x [0 .. T—t] —)• M 

( CU f (/) +CU f+r+1 (r) — CU (+r+1 (Z + r + 1) ift + r<T, 
,r ^[CU f (/) else. 


The following lemma shows where the concavity of the start-up cost function CU is used. 
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Lemma 2 For each t € [T], S' is increasing in I and r if C U is concave , and strictly increasing 
if CU is strictly concave. 

Proof. Let t G [T], l, l G [0 .. t— 1] with 1 < Tand r, r € [0 .. T—t] with r < r be given. Denote 
the period indices 

ci := f + r+ 1 and Z/ := f + r + 1, 

as seen in Fig. [3} implying q < q. Recall that by definition, CU f (Z) = CU(OL'(/)), where 
OL' (/) denotes the offline length corresponding to / offline periods prior to period t (see ©)• 
In Fig. |3]for example, / corresponds to oL(v) and r corresponds to or' (v). So, OL q (l + r + 1) 
is the offline time prior to q in v, and OL'fir) and OL' (/) are the offline times prior to q and t 
in v, respectively. Hence 

OL q (l + r+ 1) = OL 9 (r+ 1) + OL f (Z). 

We start by proving that S' is increasing in /. For t + r = T, S' increases in / since CU' 
increases, 

S'(J,r) — S t (l,r) = CU f (Z) — CU f (Z) > 0. 


For t + r < T, we obtain 

S r (/» - S' (l, r) = C\J q (r) + CU f (l) - CU ? (Z + r + 1) 

- ^CU^(r) + CU'(Z) — C\J q (l + r + 1)^ 

= (CU*(/)-CU*(Z)) -(CU^f+r+lJ-CU^Z + r+l)) 

= (CU(OL f (/))-CU(OL f (/))) - (CU(OL 9 (/ + r+l))-CU(OL' ? (/ + r+l))) 

= (CU(OL r (/) + OL^'' 1 (/"- /)) - CU(OL f (/))) 

- (cU(OL«(/ + r+l)+OL r -'- 1 (f-/))-CU(OL 9 (/ + r+l))) 

which, when abbreviating x := OL f (/), y := OL^(Z + r+ 1), and 5 := OL' - ^ 1 (l — /), equals 

= (cU(jc + s)-CTJ(jc)) - (CU(y + s)-CU(y)). 

The non-negativity of the final term follows from the characterization of concave functions 
by subdifferentials, using that x < y and s > 0. 

The statement that S' is increasing in r follows analogously in the case of t + r<T. IF 
t + r = T, then 

8\l,r) - S\fr) = CU f (Z) - (CU'(/) + CU«(r) -C\J q (l+r+ 1)) 

= CU‘ ? (/ + r+l)-CU‘ ? (r) >0. 

Finally, in case of a strictly concave start-up cost function CU, all of the above inequalities 
are also strict. □ 

Using our claim regarding the lifted coefficients (see ©) may be restated as 

a<rCZ) = 5 ffW (ol aW (v 7 -),or ff W(v ; .)). 
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Based on Lemma[2] this could be proved by induction over j. Furthermore, Bertsimas and 


Weismantel 


(2005) show that, as cu E > 0 is valid for epi(LCU i: ), these inequalities define 
facets of cpifLCLr'). Since the current subsection is intended to be purely motivational, we 
prefer to re-introduce the lifted inequalities in Section [5] and prove that they induce facets 
without referring to the sequential lifting method. The necessary steps are essentially the 
same: 

• proving that the lifted inequalities are feasible requires the same arguments as showing 
a o{j) < 5 <T ^(ol ff ^(y / -),or°'^(y 7 -)), and 

• proving that these inequalities induce facets requires the same arguments as showing 

a aU) > 8^\ol^(vj), or<^(v/))- 


4 Notation for Binary Trees 

Describing the lifted inequalities of the last subsection necessitates non-standard notation 
for binary trees, which is presented in this section. A binary tree is defined as an undirected, 
rooted tree, where each node t has at most two child nodes: a left child llink(f) and a right 
child rlink(?) (cf. Fig. [5]). 

Basic notation includes: 

• Each binary tree B has a root denoted by root (71). 

• The path from a node t to the root is unique. The number d(t) of its edges is called the 
depth of t, and its nodes are the ancestors of t (including t itself). Vice versa, t is called a 
descendant of each of its ancestors. 

• The first ancestor, i. e. the node immediately succeeding t on the path to the root, is called 
the parent p{t) of t. Conversely, t is said to be a child of p(t). 

• The subtree S(t) comprising all descendants of a node t (including t) is the principal 
subtree in t. If the left/right child of t exists, then its principal subtree is the left sub¬ 
tree L(t)/right subtree R(t) of t. All of these subtrees are binary trees as well. 

• The number of nodes in these subtrees in t are denoted by s(t) := |S(f)|, l[t) := |L(f)| 
and r(t) := |/?(f)|. 

• The rank function is the unique mapping from the nodes V of a binary tree B to I..... | V 
such that 

. , . ... . , f < rank(p(f)) if t is the left child of pit), 

Vt £ VWroot (B } : rank(()< J 

| > rank(p(t)) if t is the right child of p(t), 

and such that the nodes of each principal subtree S(t) are a mapped to a contiguous 
interval. 

The rank function has a straight-forward interpretation: When drawing a binary tree such that 
left (right) childs are located below left (right) of their parents, the rank numbers the nodes 
from left to right (see Fig. [4]). Several basic properties can be derived from this observation: 

• For each binary tree B and (6 8, it holds that 

(rank(7) 1 1' £ 5(i)} = {rank(f') 1 1' £ L(t)} U {rank(t)} U {rank^') 1 1' £ R(t)}. 

• Since by definition the ranks of nodes in a subtree are contiguous, it holds that 

{rank(f') 1 1' £ L(t)} = [rank(t)— l{t) .. rank(f) — 1], 

{rank(f') 1 1' £ R(t)} = [rank(f) + l .. rank (t)+r(t)], and 
{rank(t') 1 1' £ S(t)} = [rank (t)—l(t) .. rank(t)+r(t)]. 


( 18 ) 










• Finally, since L(t) = S(llink(t)) and R(t) = 5(rlink(t)), we have 

rank(llink(?)) + r(llink(f)) + i = rank(t) = rank(rlink(t)) — /(rlink(f)) — 1. (19) 

For example, in the binary tree shown in Fig. |4]it holds that 

(rank(F) |F G S(t)} = [7..10] U{11}U {12} and 

{rank(f')|f'€i(0} rank(f') {rank(/')|^6t?(d} 


^ + s-i-/ +1 = ^ = —i- 

rank(llink(f)) r(llink(f)) rank(f) rank(rlink(;)) /(rlink(f)) 


root(B) 



/(F) =4 r(F) = 1 

s(t') = 6 


Fig. 4: A binary tree with nodes labeled by ranks from 1 to 12, and root(fi) = 6. The parent 
of node t = 3 is pit) = 5, its left child is llink(t) = 1 and its right child is rlink(f) = 4. The 
principal subtree 5(F) in F = II is marked in blue, its left subtree L(F) in green and its right 
subtree R(t') in red. The respective subtree sizes are .v(F), /(F) and r(F). 

The special nature of the coefficients of the facets of epi(LCU z ) is best characterized by 
some non-standard notation, the top-left and top-right nodes (see Fig. [5}. These nodes are 
defined recursively, with the root being the first top-left and top-light node. Left childs of 
top-left nodes are also top-left nodes, and right childs of top-right nodes are top-right nodes 
too. The last top-left node has rank 1, and the last top-right node has rank |F|. 

Definition 3 For each binary tree B on n nodes, define p 1 := t] ] := root(fi), 

V./ € [d( 1)] : M /+1 := llink(/i ; ), V; <E [d(n)] : B j+1 := rlink(rj ; ), 

where Pj denotes the j-th top-left node and i ] ; denotes the j-th top-right node. 

Conversely, ancestors of top-left (top-right) nodes must be top-left (top-right) nodes as well. 

We continue by proving basic facts regarding top-right nodes, and transfer them to top-left 
nodes by “mirroring” the binary trees. 
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Fig. 5: A binary tree with top-left nodes p ( and top-right nodes rj ( . Both p, and t] { always 
equal root (71). The last top-left node p d ^ +1 is always the node with rank 1, and the last 
top-right node ?? rf( r)+1 is always the node with maximal rank. 


The node n is contained in the principal subtree of each top-right node t] ■, and by 
definition possesses the maximal rank. Since the principal subtree of rj ■ spans the ranks 
[rank(t 7 / )— l(r\j) .. rank(r 7 J )+r(T 7 / )] (see (fT8|)), it must hold that rank(T 7 ,) +r(r] i ) = n. On 
the other hand, if the rank of a node t fulfills rank(t) + r(t) = n. then n lies in the right subtree 
of t. So, t is an ancestor of n, and thus a top-right node. 

Proposition 4 For each binary tree B on n nodes, a node t is a top-right node in B iff 
rank(t) +r(t) = n. 

This result may be extended to subtrees as well. 

Lemma 5 For each binary tree B on n nodes, a node t is a top-right node in the left subtree 
of a node t' iffmnklf) + r{t) + 1 = rank(T). 

Proof. By ( fT8| ), the nodes in the left subtree Lff) have ranks [rank(ri)— Iff) .. rank(f') — 1 ]. 
Thus, the rank function rank / ( ' J of L(t r ), which must keep the same order as rank and ranges 
from 1 to Iff) equates 

rank L(f,) (t) = rank(t) -rank(t') +lff) + \. 

Hence, each node t G Lff) is a top-right node in Lff ) iff rank i( ' \t) + r(t ) = \Lff)\ = Iff) 
(see Proposition [4]), which may be rewritten as 

rank(f) +r(t) + 1 = rank i(f,) (t) +rank(t') - iff) - 1 + r(t) + 1 = rank(t'). 

Assume now that there exists a node t of B with t ^ Lff) but rank(t)=rank(t')— r(t) — 1. 
Then rank(t) < rank(t'), and t f S(t'). On the other hand, rank(7) > rank(?) + r(t) and hence 
t'isf). 

Choose the first common ancestor t of t and t' , which due to t f Sff) and t' S(t) equals 
neither t nor t'. By choice of t it holds that t G L{t) and t' G R(t), and thus 

rank(t) + r(t) < rank(f) < rank(f / ) — iff), 

a contradiction to rank(f) + rff) + 1 = rank(t'). □ 

Note that since rank(t) + r(t) + 1 G [rank(t) + l .. n+ 1], t must be either a top-right node 
or a top-right node in the left subtree of some node t'. 
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Corollary 6 For each binary tree on n nodes, a node t is 

• a top-right node ///’ rank(r) + rit) = n, and 

• a top-right node in the left subtree of the node t' iff rank(t') = rank(f) + r(t) + 1 iff 
rank(t) + r(t) < n. 

Let B denote the “mirrored” version of a binary tree B of size n, i. e. the binary tree 
that results from exchanging the left and right childs of each node. The depth, parent and 
subtree size functions remain unchanged ( d B = d B , p B = p B , s B = s B ), the left and right 
subtree size functions are exchanged (r B = l B , 1 B = r 8 ), and the rank function is mirrored 
(rank B (t) = n + 1 — rank 5 ). By applying this mirToring to the properties of the top-right nodes, 
we can derive equivalent properties of the top-left nodes. 

Corollary 7 For each binary tree, a node t is 

• a top-left node /j^rank(t) = l(t) + 1 , and 

• a top-left node in the right subtree of the node 1 with rank(Z) = rank (7) — lit) — 1 iff 
rank(t) > l(t) + 1 . 

The root of a binary tree on n nodes is both a top-right and top-left node, and thus its rank 
equals rank(root(B)) = n — r(r) = l(r) + 1. 

5 The Binary Tree Inequalities 

In this subsection, we show that all lifted inequalities correspond in a one-to-one way to 
binary trees, which motivates naming them binary tree inequalities (BTIs). Together with 
0 < v 1 < 1, these inequalities induce all facets of epi(LCU 5: ). 

We start with an example where a 4 and a 9 are lifted in both possible orders, 

• a 4 before ag with intermediate vector Vj, and 

• a 9 before 0:4 with intermediate vector Vj (cf. Fig. | 6 |). 
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Fig. 6: Lifting coefficients a 4 and ag in both orders, with intermediate vectors. 
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If the coefficient a(, is already lifted, the relative order in which 0:4 and ag are lifted does 
not influence the period counts, 

ol \vj) = ol 4 (v i+1 ) and or 4 (v 7 ) = or 4 (v j+1 ), 

ol = ol 9 (v i+1 ) and ol 9 (v / ) = ol 9 (v ;+1 ). 

Thereby, the values of the lifted coefficients a 4 and ag are equal for both cases. This 
holds in general: As soon as a coefficient a t has been lifted, for each subsequently lifted 
coefficient a t > with t' <t we have or' (v) <t — t' . Thus, the period counts, and by extension 
the coefficient a t ', do not depend on the lifting order of coefficients a,- with t > t. Analogously 
if t' > t, the coefficient a ,i does not depend on the lifting order of coefficients a~ t with t < t. 
A lifting order a corresponds to a linear order on [T] with 

Vfi,f2 G [T] : t\ t2 cr 1 (fi) < cr -1 ^)- 


As argued above, the lifted coefficients a tl , a t2 do not depend on whether t\ tn or tj A <7 t\ 
if 


3 ?3 £ (I1T2) u (?1, ?2), t 3 ^ ah , 


Eliminating such relationships from yields a paitial order ^' a which fully determines 
the lifted inequality, i. e. each linearization of this partial order leads to the same lifted 
inequality. Fig. [7] shows an exemplary partial order (twice, as a Hasse-diagram) and two 
possible linearizations ^ and Act represented by the permutations a and a. 




Fig. 7 : A partial order determining the lifted coefficients (twice, as a Hasse-diagram) and 
two possible linearizations (as permutations a and a) leading to the same lifted inequality. 

As Fig. [7] suggests, the Hasse-diagrams of these partial orders are binary trees, where 
each coefficient a t corresponds to a node t. The partial order then prescribes that each 
coefficient a t must be lifted before the coefficients associated with the descendants of node t. 

Observe that for a lifting order a and the corresponding linear order the vectors v - 
encountered in the lifting process by definition ( fT5T ) fulfill 

v ' e pi-‘ e ": '2t aU) ' 
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Therefore, the offline lengths ol CT ^(v ; ) and orwhich define equate to 

o ,CT(,) (v/) = max{/ e [0 .. o(j)-l] \ \/t E [a(j)-l.. a(j)-l] : t <r(;)}, and 
or CT W(v 7 ) =max{/ e [0 .. T-a(j)\\\/t E [o(j )+1 .. c{j)+l} : t 4 a a(y)}. 


As argued, these lengths remain unchanged when replacing =4 a by the corresponding partial 
order Furthermore, their above representation shows that they can be derived from the 
Hasse-diagram of The size l(a(j)) of the left subtree of <j(j) equals ol ff ^(v-) and the 
size r(a(j)) of the right subtree of <j(j) equals or CT( ^(v ; ) (see Fig. | 8 1 . 



Fig. 8: The lifting step as shown in Fig. [ 3 ] with a possible lifting order represented by a 
binary tree B. The offline lengths ol CT ^(v ; ) and oi' fTi / - l (v' / ) adjacent to period o( j) are equal 
to the sizes of the left and right subtree of node o( j) in B. 


A coefficient a t therefore solely depends on the partial order A a and may be expressed as 


a, = 8'{l(t),r(t)), 


where t = <j(J), and l(t),r(t) are determined by the partial order. 

Each of the start-up cost terms in 8 f (l(t) ,r{t)) corresponds to the cost incurred when 
starting up after being offline during the periods contained in either the left subtree L(t) of t, 
the right subtree R(t ) of t, or the principal subtree S{t) of t (see (fT8])), 

8‘ r(t)) = CU f (/(0)+CU f+r W +1 (r(t))-CU f+r W +1 (/(t) + l + r(0) 

S -V- ^ ^-V-^ ^-V- y 

offline in L{t) offline in R(t) offline in S(t) 

To simplify the notation, we generalize the definition (see ([4])) of the offline length OL: 
for each B E and t E [T] , we abbreviate 

OL(S(t)):=OL f+r W +1 ( 5 (f)), (20) 

and consequently, by < [T8j ) 

OL(L(t)) = OL r (/(?)) and OL(R{t)) = OL r+r(f)+1 (r(f)). (21) 

Since by definition CU ? (/) = CU(OL'(/)), it follows that 

8* (l(t),r(t)) = CU(OL(L(t))) + CU(OL(7?(t))) — CU(OL(S , (t))). (22) 
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Definition 8 A rank-labeled binary tree is a binary tree B on nodes \n], where n G No, and 
rank(/) = i for all i G [n]. Let 33 denote the family of all rank-labeled binary trees on T], For 
each B G 3$, we define the binary tree inequality (BTI) as 

cu z > £ 8‘(l(t),r(t))v‘, 
te[T] 

using the sizes l(t) or r(t) of the left or right subtrees of t, respectively, and 8 1 as defined in 
Proposition^ 

In the following, we confirm that the binary tree inequalities, together with the trivial 
inequalities 0 < v’ < 1, define all non-trivial facets of epi(LCU i; ) by proving that 

• they are feasible (Lemma [T0|), 

• they are fulfilled with equality by all points (v i ,DCU i: (v J )) encountered during the lifting 
process (Lemma [IT]), and 

• these points are linearly independent (Theorem [T3]) . 

Moreover, we will show that all points not in epi(LCLr') can be separated by the BTIs or the 
trivial inequalities 0 < v' < 1 in ff(T). 

In the following, we need to put the vertices (v,DCU i: (v)) of the epigraph epi(LCU z ) 
into relation with the binary trees B G 33. 

Definition 9 For each B G 33 with edges E and v G {0, l} r , define B(v) as the subgraph of 
B induced by the 1-entries of v, 

B(v) := (y s ,E s ) where := {t G [T] \ V = 1} and E s := {e e E \eCV s }. 


Note that in general, B(v) is not connected. 

The proof that the binary tree inequalities are valid implicitly follows from the concavity 
of CU, which is exploited through the monotonicity of 5* (Lemma[2]). 

Lemma 10 For each B G 33, the BTI induced by B is valid for epi(LCU z ). 

Proof. Due to the convexity of epi(LCU i: ) and since a binary tree inequality bounds cu E 
only from below, it suffices to prove that all vertices (v,DCU E (v)) G V L of epi(LCU i: ) fulfill 
all BTIs. To do so, for each B G ^3, we prove that its induced BTI is valid by induction over 
the number of nodes n in its induced subtree B(v). 

For n = 0, the only such point is 0 G V E . Since each BTI is homogeneous, 0 fulfills all of 
them with equality. For n> 1, choose a leaf t in B(v), i. e. a node such that the subtree of t in 
B does not contain any other nodes from B(v). Define the vector v as 


v := 


if t't, 
if t' = t, 


differing from v only in period t. By Proposition [T] we get 

DCU e (v) = DCU^v) + <5'(ol f (v),or'(v)). 

These vectors, the induced subtrees, and the offline lengths oL(v), or f (v) are shown in Fig. [ 9 ] 
By the choice of t, its left subtree, which contains the nodes t — /(f),... ,t — 1, does not 
contain any nodes from B(v). Hence, = ... = v f_1 = 0, implying ol r (v) > l(t). Since 

the right subtree of t does not contain any nodes from B(v) either, we analogously obtain 
or f (v) > r(t). 
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v = (0 1 0 0 0 0 1 0 1 0 0) 


Fig. 9: Removing the leaf 1 = 7 from the induced subgraph B(v) results in the subgraph 
B(y) induced by v. The lengths ol 7 (v) and or 7 (v) denote the offline lengths before and after 
period 7 (see Q and (fl7|)). 


Thus, using the monotonicity of 8‘, it holds that 

DCU e (v) = DCU E (v) + 5'(ol'(v),or'(v)) >DCU E (v) + 5'(/(0,K0) 

ind.hyp. , . , , 

> I {lit')At 0)v f +5\l{t)At )) = £ 5' (/(!'), r{t')A . □ 

t'e[T] r'e[r] 

The central argument of the proof of the last lemma is that in each step of the induction, 
the inequality 

5'(ol f (v),or f (v)) > 8'(l{t),r{t)) 


holds due to ol'(v) > lit) and or'(v) > r(t). Assume that, for a given binary tree B and a 
vertex of epi(LCU £ ), this inequality is fulfilled with equality in each induction step. Then 
the vertex also fulfills the BTI induced by B with equality. We characterize such vertices in 
the next lemma. 


Lemma 11 For each B E SS, (v,DCU £ (v)) E if the induced subgraph B(v) is a tree 
containing root(B), then (v,DCU Z (v)) fulfills the BTI induced by B with equality. 


Proof. Analogously to the proof of Lemmaflo} we show that (v,DCU 2: (v)) fulfills the binary 
tree inequality induced by B E 38 by induction over the number of nodes n in the subgraph 
B(v). 

For n = 0, the only such vertex is 0 E V £ , which fulfills all BTIs with equality. For n > 1, 
let t be a leaf of B(v), and define v := v — it, as in the proof of Lemma 10 Note that since 
B(v) is a tree and root(B) E B(v), also the subgraph B(v) is a tree with n — 1 nodes, which 
contains root(B) in case of n > 2. 

Since no nodes in the principal subtree of t, except t itself, are in B(v). analogeously 
to the proof of Lemma [!()} it holds that ol f (v) > lit ) and or' (v ) > r{t). We show that these 
bounds are sharp by examining two cases for ol f (v) = /(f) (see Fig. 10 1 and two cases for 
or f (v) = r{t) (see Fig. lID- 


is 















a) t is a top-left node b) f is a top-left node in R(l) 

Fig. 10: Number of offline periods ol f (v) in v before period f, which is either bounded by the 
start of the model or by the first left ancestor l of node t 




a) f is a top-right node b) / is a top-right node in L(r) 

Fig. 11: Number of offline periods or r (v) in vector v after period f, which is either bounded 
by the end of the model or by the first right ancestor r of node / 


If f is a top-left node, then f=/ (f) +1 (see Corollary [7]), implying 1 (t ) <ol f (v) <t — 1 = 1 (f), 
and thus ol f (v) = /(f). Otherwise, there exists a node / such that t is a top-left node in the 
right subtree R(l) of I. Since I is an ancestor of f, we have 1 G B{v). Hence, 

/(f) <ol r (v) Cor °= ry ^(/~+/(f) + l) —f— 1 = /(f). 

If f is a top-right node, then f = r(t) — 1, implying r(f) < or f (v) < f + 1 = r(f). Else, f is 
a top-right node in the left subtree of a node r. Since f is an ancestor of f, we have r G B(v), 
and thus again 

r(t) <or f (v) <r — t— l Lei = ia ^-(f). 


In conclusion, we have shown ol f (v) = /(f) and or ( (v) = r{t). Analogous to Lemma 10 
DCU E (v)=DCU i: (v) + 5 f (ol f (v),or'(v))=DCU 5: (v) + 5 f (/(f),r(f)) 

E S'\l(,’), r (l’))i' J + S\l(t),r(l))= E «•'(/(<'), r(t'))v<’. □ 

t'e[r] f's[r] 
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In general, a vertex (v, DCU E (v)) may lie on a facet induced by a binary tree B which does 
not meet the requirements of the preceding lemma, e. g. for a linear start-up cost function. The 
next result shows that this does not happen for strictly concave start-up cost functions CU. 

Lemma 12 Let CU be strictly concave, B G 3d, and (v,DCU £ (v)) G Vf If (v, DCU s (v)) 
fulfills the BTI induced by B with equality, then either v = 0 or the induced subgraph B(v) is 
a tree containing root(fi). 

Proof. Again we prove the statement by induction over the number of nodes n in the subgraph 
B(v). The case n = 0 is fulfilled trivially. 

For n > 1, analogously to the proof of Lemma |1()| choose a node t such that the subtree 
of t in B does not contain any other nodes from B(v). This choice implies of (v) > / (t) and 
or'(v) > r(t), and thus, because of concavity, 

<T(ol f (v),or'(v)) > 8 f (l(t),r(t)). 

For v :=v — u t . Proposition [T] yields 

DCU £ (v) - <S f (of(v),or'(v)) = DCU z (v) 

which, since (v,DCU E (v)) G epi(LCU E ), may be bounded by 

> E «''(/(<'),r(('))v'' = £ r(,)) 

f'€[r] t'e[T] 

and, by the choice of v and t, equals 

= DCU £ (v) - 5 f (/(0,r(0) > DCU e (v) - <5'(of (v),or f (v)). 

Therefore, the above inequality is fulfilled with equality, and we conclude 

5'(of(v),or f (v)) = 8 f {l(t),r{t)) and DCU £ (v) = £ 8 1 '(l(t'),r(t'))v t '. 

t'm 

Firstly, as CU is assumed to be strictly concave, 8' is strictly increasing in / and r (see 
Proposition [TJ. Recalling that of (v) > /(?) and or'(v) > r(t) by the choice of t, we infer 
of (v) = l(t) and or'(v) = r(t). 

Secondly, since |fi(v)| = \B(v)\ — 1, the induction hypothesis states that either v = 0 or 
B(v) is a tree containing root(R). If v = 0, then 

s(t) = l(t) + 1 + r(f) = of (v) + 1 +or f (v) = t — 1 + 1 + (T — f) = T , 

hence t is the root of B and B(v) is the single-noded tree consisting of t = root(B). 

Else v f 0, implying that B(v) is a tree containing root( /?). Therefore t f root(fi), meaning 
that t has a parent p(t). By ( fl9l >, if t is a right child, then 

p(t) = t — l(t) — 1 = t — of (v) — 1, 


and if t is a left child, then 


p(t) = t + r(t) + 1 = t + or f (v) + 1. 

In both cases, v p ^ = 1 by definition of of (v) or or f (v) (§ and (fT7|)), and thus p(t ) G B(y). 
Given that B(v) is a tree containing root(R), we obtain that B(v) must be a tree containing 
root(B) too. □ 
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The vectors v encountered during a lifting process gain a non-zero entry in every step, 
and are thus linearly independent. 

Theorem 13 All binary tree inequalities induce facets o/epi (LCLm). 


Proof. For each binary tree B E SB, the induced BTI is valid for epi(LCU L ) (see Lemma 101. 

Choose a permutation a of [7] such that the nodes ait) are ordered by their depth d(o(t)) 
in B, i. e. such that for each t E [T — 1] it holds that d(a(t)) < d(a(t + 1)). For each j E [ T ], 
define the vector 

'l if a-\t)<j, 


W E [T] : v := 


0 else, 


which induces the subgraph B(v-) containing the nodes (tj(l),..., a(j)}. B(v-) fulfills the 
requirements of Lemma [TTj and thus the vertex (vy, DCU E (v / -)) fulfills the BTI with equality. 
Denoting the permutation matrix associated to a with I l a , we obtain 


n ff • (vj v 2 



i\ 

i 


\o ... 0 1/ 

and thus 0, (v 1 ,DCU i: (v 1 )), ..., ( v T . DCLm( v T )) are affine linearly independent. Since they 
amount to T +1 points on the face F of epi(LCU i: ) induced by the BTI, F must be a facet.D 


6 Sufficiency of the BTIs 

In this subsection, we prove that the binary tree inequalities (Definition [8]), together with the 
trivial facets 0 < v‘ < 1, are sufficient for an ^-representation of epi(LCU i: ). To do so we 
show that the facets induced by all the BTIs fully describe the lower boundary of cpi(LCU“). 
To this end, we extend the characterization of the facets on which a vertex (v,DCU £ (v)) lies 
(Lemma [IT]) to points with v E [0, l] r . 

Lemma m provides the sufficient (but not necessary) condition for the vertices of 
epi(LCU z ) lying on a certain facet: “A vertex (v,DCU i: (v)) lies on the facet corresponding 
to a binary tree B E 38, if the induced subgraph B(v) is a tree containing root(B).” In other 
words, each node t E B(v) needs to be connected to root(B) within B(v). The unique path P, 
from t to root(B) consists of its ancestors, 

Pt = t -> p{t) -> p{p(t )) -)• p{p{p(t))) root(B). 


By definition, t E B{v) v 1 = 1. So, B(v) contains the paths P t for all t E B(v) iff 
V = 1 ^ v p{t) = 1 for all te[T}\ {root(B)}, 

which, since v E {0,1} T , is equivalent to 

v‘ < v p{ - t] for all te[T}\ {root(fi)}. (23) 

Coincidentally, this condition is also important when searching efficiently in a point set in 
the Cartesian plane, and is denoted by “B is a Cartesian tree for v” in Vuillemin (1980 1 . We 
use a definition adapted to our puiposes, equivalent to the recursive definition of Cartesian 
trees in |Gabow et al.| ( [T984 l. 
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Definition 14 For each v £ M r , a binary tree with 

v ? < v pW for all f € [r] \ {root(fi)} 


is called a Cartesian tree for v. 


The following lemma shows that condition (23) applies to any v 6 [0,1] as well. 


Lemma 15 Ifv £ [0, l] r and B 6 'dS is a Cartesian tree for v, then (v,LCU i; (v)) lies on the 
facet o/epi(LCU i: ) induced by B. 


Proof. For each v £ [0, l] r and each Cartesian tree B £ dS for v, we decompose the vector v 
into binary vectors Vj, which, paired with the summed start-up costs DCU £ (vy), lie on the 


facet of epi(LCU E ) induced by B. This decomposition is depicted schematically in Fig. 
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convex combination: factors and vectors 



Fig. 12: Decomposition of a fractional vector v £ [0, l] 5 into binary vectors v ; - £ {0,1 } 5 , such 
that all points (v-,LCU E (v-)) lie on the same facet. 


Choose a permutation a which orders the indices t £ [T] by decreasing value of v f , and 
increasing depth d(t) in B if the values v 1 arc equal, i. e. such that 

Vjiji G [T],j\ < h : v a(]l) > v a(n) and 

x Mh) = v a(j2) ^ d(a(ji)) < d(a(j 2 )). 

For all / £ [T], define the vectors i;• £ {0,1 } T and the coefficients Ay as 

, fl ifa -1 (t)<y, f v ° P (^) — v CT 0'+i) for j<T-l, 

J 1^0 else, else. 

By definition of a, all A j are non-negative, and obviously they sum up to v^ 1 ’. Hence, for 
each t £ [T], it holds that 

£A;vj= £ A j= £ (v tT W_ v ^a+ 1 )) +v a(r) =v <T(a~ 1 (r)) =v r_ 

J = 1 j=0~ l {*) j=G~ l (,t) 
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Therefore, the vector v indeed is a convex combination of the vectors v ■ and 0, 

V = £ A/Vy + (1 -v ffll) )0. 

7=1 

We proceed by showing that the vertices (v-,DCU E (v-)) lie on the facet induced by 
B. For each j E [T], the induced subgraph fi(v-) is the induced subgraph of B on nodes 
{t E [J] | a -1 (t) < jj. Consider an arbitrary node t E 5(v-) and its parent pit). Due to the 
definition of B. both 

v ? < and d(t) > d(p(t)) hold. 


So, by choice of the permutation a, we have 


and hence p(t) E 5(v -) too. As noted in the motivation preceding this lemma, this condition 
is equivalent to “fi(vy) contains the root of B and is a tree”, and thus LemmaJTTj implies that 
(\’j, DCU^Vy)) lies on the facet induced by B. 

Being a convex combination of such points, the point (v,£j =1 A/DCU'fvy)) lies on the 
same facet, implying LCU z (v) = Y.j=i A/DCU E (v-) and therefore that (v,LCU z (v)) lies on 
the facet of epi(LCU z ) induced by B. □ 


Note that Lemma 


15 


is a 


generalization of Lemma [TT| since for discrete vectors v E {0,1 } r 
the induced subgraph B(v) is a Cartesian tree for v iff it is empty or a tree containing root(71). 

By the last result, each BTI defines a part of the lower boundary of epi(LCU z ), and 
thereby also a part of LCU £ . 


Corollary 16 IfB E -7# is a Cartesian tree for v E [0. 1 ] T , then 


LCU e (v) = £ <5 f (/(0,r(t))v f . 

te[T] 


By definition, a Cartesian tree for some v is also a Cartesian tree for Av with A > 0, which 
means Corollary [l6] states that LCU 5 " is homogeneous. 

Corollary 1 7 LCU 1 " is homogeneous, i. e. 

VA e [0, l],v E [0, l] T : ALCU^v) = LCU £ (Av). 


Since epigraphs are characterized by their lower boundary, this is equivalent to: 

Corollary 18 If (v , cu z ) E [0, l] 7 " x M and B E 38 is a Cartesian tree for v, then (v,cu E ) lies 
in epi(LCU E ) iff it fulfills the BTI induced by B. 

Proof. (v,cu z ) E epi(LCU z ) <t=>- cu £ > LCU i: (v) = ^ 5 r (Z(f),r(f))v ( 

f 6 [r] 

<E=> (v,cu £ ) fulfills the BTI induced by B. □ 


Finally, since there exists a rank-labeled Cartesian tree for each v E [0,1 ] 7 (Gabow et al. 


(1984)), the BTIs and the trivial inequalities 0 < v f < 1 completely describe cpifLCU 2 '). 
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Theorem 19 


epi(LCU £ ) 


' (v,cu z ) G R T+l : 

cu z > Y 5'(/(0>K0)v f , 

tm 

0<v f <l, 


fiGJ 

*eP1 


(24) 


It remains to discuss whether this ^-representation is irredundant. As noted in the 
paragraph before Lemma [T2| if the start-up cost function CU is not strictly concave, e. g. if 
CU is linear, then a vertex (v,DCU z (v)) may fulfill a BTI induced by a binary tree B G 38 
which is not a Cartesian tree for v. This leads to multiple binary trees inducing the same facet 
of epi(LCU i: ), rendering the above ^-representation redundant. 

describes all vertices (v,DCU z (v)) on a 


However, if CU is strictly concave, Lemma 
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facet. By showing that different binary trees induce facets with different vertices, we prove 
that the given ^-representation is irredundant. 


Theorem 20 If CU is strictly concave, then the Jrf?-representation ( f24| ) o/epi(LCU i: ) is 
irredundant. 


Proof. For each B\,Bi G dS with B\ / Bi, we construct a vertex (v,DCU E (v)) G V z such 
that B\ is a Cartesian tree for v, but /L is not a Cartesian tree for v. Then, by Lemma 11 
(v,DCU i: (v)) lies on the facet induced by B\, and by Lemma 12 (v,DCU E (v)) does not lie 


on the facet induced by Bo, proving that the induced facets arc not equal. 

If root(/i|) f root(TL), then (M root ( Bl ),DCU z (M root ( Sl) )) clearly is such a vertex. Other¬ 
wise, we have r := root (B\ ) = rootf/L j, and therefore the edge sets E\ of B\ and Eo of Bo 
differ. Each edge in B\ connects a node t to its parent p Bl (t). Choose e = {t,p Bi (t) } G E\ \Eo 
with minimal d ( t). Since e (j £2 and t f r, t has different parents p\'.= p Bl (t) and po := p B - (t) 
in B\ and Bo. Now, denote the path from r to p\ in B\ by P, and define v G {0, I } T as 


Vf' G [T] : 


1 if t' = t or t' G P, 
0 else. 


By definition, B 1 is a Cartesian tree for v. 

We conclude this proof by showing that p 2 (jo P, which implies v Pl = 0, and thereby that 
Bo is not a Cartesian tree for v: Assume P 2 G P. Then, since choosing e such that d(t) is 
minimal, we have P C Bo, and therefore p\ is a descendant of po in both B\ and Bo. Since t 
is a child of P 2 in Bo and t <j P,\{ holds that t and p\ are separated to the subtrees L Bl (p 2 ) 
and R B2 (P 2 ) of P 2 , and thus either t < P 2 < pi or p\ < P 2 < t. However, t is a child of pi in 
B\, and thus both, p\ and t, either lie in the left subtree L B] ( P 2 ) or right subtree R 11 ' (p 2 j of 
P 2 in B\. Since this implies either t ,p\ < po_ or /;? <t.p\, we obtain a contradiction, proving 
po_fP. □ 


By Rosen (1999), the number of different binary trees on T nodes is the T -th Catalan 
number, which allows us to count the number of facets of epi(LCU i: ). 

Corollary 21 If CU is strictly concave, the number of facets o/epi(LCU 5 ') is 


Cj + 2T ~ 


4 t 

_ 3 ,— ? 

Ti^n 


where Cj denotes the T-th Catalan number. 


21 










7 Separation 


Generally, the ^-representation of epi(LCU z ) given in the last subsection is of exponential 
size, and thus not (directly) suitable for computational purposes. This is overcome by a 
cutting plane approach based on an exact separation algorithm for epi(LCU^) presented in 
this section. Assuming that the start-up cost function CU can be evaluated in ^(1), as is the 
case for the exponential start-up cost function 0 . we show that this separation algorithm has 
a running time of Otherwise, the running time would change proportionally, 

states that a point (v,cu £ ) 


Lemma 


15 


G [0, l] r xM lies in epi(LCLr 


iff the BTI induced 

by the Cartesian tree for v is fulfilled. Thus, the idea of the separation algorithm for epi(LCU z ) 
is to find a Cartesian tree for the vector v, and construct its induced BTI. 


A linear-time algorithm for finding a Cartesian tree for v has already been given in Gabow 


et al. (19841, and is denoted as FindCartesianTree. In summary, this algorithm starts with 


a tree with a single node 1, and iteratively adds the remaining nodes f G [2 ..T], The key 
observation is that the node f in each iteration must be added such that it results to be 


• the last top-right node (as to receive the correct rank f), and to be 

• beneath all top-right nodes t' with v f > v' and above all top-right nodes t' with v*' <v‘. 
The algorithm represents the resulting Cartesian tree by the left and right childs llink(f) 

and rlink(f) of each node f G [T]. To construct the induced BTI, its coefficients 


a t = 8‘(l(t),r(t)) = CU(OL(L(f)))+CU(OL(rt(f)))-CU(OL(5(f))) (25) 


need to be computed. This requires the subtree sizes /(f), r(t) and s(t), which due to ( p~8] ) and 
( fl9l ) equal 


/(f) = s(llink(f)) = /(llink(f)) + 1 + r(llink(f)) = /(llink(f)) +f — llink(f), (26) 

r(f) = s(rlink(f)) = /(rlink(f)) + 1 + r(rlink(f)) = rlink(f) — f + r(rlink(f)), (27) 

s(t) = l(t) + 1 + r(f). (28) 

By the definition of the offline lengths OL'(/) (see Q), we have 

VfG [T],h € [0..f —1],/ 2 G [f-/i —1] : OV(h +1 2 ) = OV{h) + OV~ l1 (l 2 ), 


and the desired offline lengths OL(L(f)), OL(R(tj) and OL(.S’(f)) may be derived from 
([20]),([2T]) for each t G [T] as 


OL(L(f)) 

OL(R(t)) 
OL (5(f)) 


OL f (/(f)) = OL f (f-l) 


OL *-'(*)(t-l(t)-l) if /(f) < f — 1, 

0 else. 


OL r+r(0+l (,.(,)) = OL'+'-W+t + r ( f )) _ 0 L f+1 (f), 
0L r+r(r)+l( 5 (f)) = OL(L(f)) +OL{R(t)) +V. 


(29) 

(30) 

(31) 


Using (|25T)-([3T|), the coefficients a t of the induced BTI can be computed in linear time. 
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Proposition 22 A Igorithm [ 7 ] solves the separation problem for cpi (LC U E ) in A/(T). 


Algorithm 1: SeparateBTI 

Input : Point (v,cu E ) E [0, l] T X M 

Output: (v,cu E ) E epi(LCU E ), or a separating inequality. 


B •(— FindCartesianTree(v); 


for t = 1 ,... 

,T do 


m := < 

/ (llink(r)) + f — llink(t) 

0 

if llink(f) /0 
else; 

for t = T ,.. 

, 1 do 


KO := ' 

rlink(f) — t + r(rlink(t)) 

if rlink(f) / 0 
else; 

_ 5 (0 : =' 

(t) + l(t) +1; 



7 01^(0) := PDT; 

8 for t = 2,... ,T do 

9 |_ OL'(f-l) :=OL f_1 (f-2)+L f_1 ; 


to for t = l,... ,T do 


it 

12 

13 


OL(L(f)) := OL'(t — 1) 


0 


OL (R{t)) := OL ?+r W +1 (t + r(t)) - OL r+1 (f); 


OL(S(f)) := OL(L(t)) +V+OL (R(t)); 


1 ) 


if l(t) < t — 1, 
else. 


14 for t E [r] do 

f CU(OL(L(t))) +CU(OL(/J(t))) -CU(OL(S(f))) 


15 


a, := 


[CU(OL (L(f))) 


16 if cu E > ^ then 

»e[r] 

17 return (v,cu E ) Eepi(LCU E ); 


if t + r(t) < T, 
else. 


is else 

19 return (v,cu E ) may be separated from epi(LCU E ) by the BTI with coefficients 


23 









8 Experiments 


This section mainly compares the integrality gap of a Unit Commitment problem which 
models epi(LCUf) using binary tree inequalities (577) to a Unit Commitment problem using 


the start-up cost formulations presented in Carrion and Arroyo (2006) (1-Bin), Simoglou 


et al. (2010 1 (3-Bin), and Silbernagl et al. ( |2014 i ( 1-Bin * and Temp). 


8.1 Scenario 


We model the German electricity system in 2025, using the same scenario as Silbernagl et al. 
(2014). For each unit i e ■/, we consider the following parameters: 


the variable and fixed production cost coefficients A,- and Bj, 
the minimal and maximal production P, and P„ 

the maximal upward ramping speed while online (RU;) and at start-up (SU,), and 
the maximal downward ramping speed while online (RD,) and at shutdown (SD,). 

The values of these parameters are based on the the German power system ( [German Fed- 
eral Network Agency (20141) and complemented by assumptions on minimal production, 
efficiency, and start-up costs, which are partly derived from Kumar et al. ([2012]), EUREUEC 


TRIC ( [2003 ), and Egerer et al. (2014). As is expected for 2025, all nuclear units are replaced 
by combined cycle plants, resulting in a total of 223 thermal units. 

The demand data is taken from EnTSO-E] (20071 and scaled to a yearly consumption 
of 520 TWh. As wind, solar, biomass, and hydro plants are not modeled explicitly, their 
expected production is subtracted from this demand, yielding the residual demand d'. The 


production from wind and solar plants is computed based on Rienecker et al. (2011) in 
combination with the expected installed capacity in 2025 (50 GW wind capacity, 50 GW 
solar capacity). Biomass and hydro plants are assumed to produce at full capacity (5.5 GW 
biomass capacity, 4.5 GW hydro capacity). 


8.2 Base Unit Commitment Formulation 

As the focus of the experiments lies on comparing start-up cost formulations, we restrict 
ourselves to a basic model for the Unit Commitment problem. Its objective is to fulfill the 
residual demand d‘ in each period t ( [33] ), while minimizing production costs cp] and start-up 
costs cuf ( |32[ ). The production costs cp- of unit i in period / are modeled as a linear function 
of its electricity output p\ and its operational state vf in ( |34j ). The production limits in (35) 
and the ramping limits in (|36|)-(|38|) are modeled as in Carrion and Arroyo](2006 1 . 


min £ £ cp] 

+ £cuf 


(32) 

iEJ? t£[T] 




Y.ieJ'ri = d ' 


Vt g [r], 

(33) 

cp] =A i p , i +B i v t i 


Viej?,t e [T] 

(34) 

P«v/ < p\ < P/v/ 


Vf g y,t e [T ], 

(35) 

p]<p]-'+RU ( v/ 

“ 1 +SU;(v/ — v/ -1 ) +P,-(1 — v/) 

ViG/,fG[2..r], 

(36) 

p\ > p‘r' - rd,v/ 

— SD,(v/ _1 — vf) +P;(1 — v/ -1 ) 

VieJ r ,te[2..T], 

(37) 

p\ < Pjv/ +1 + SD,-(v/ — v/ +1 ) 

Vi£J?,te[T- 1], 

(38) 

v/ € {0, l},p],cp] 

et>o 

Vi€ G [T]. 

(39) 
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8.3 Start-up Cost Formulations 


The term cuf in (32) represents the summed start-up costs of unit i according to the thermal 


start-up cost function in Q, 

Vi £J,L £ 


CU i(L) = 


'Vi(\-e KL )+Fi if L > 0, 

0 else. 


In <(4j). this function is discretized such that CU-(Z) corresponds to the start-up cost incurred 
by unit i in period t after an offline time of / periods. While the formulations 1-Bin, 1-Bin* 
and 3-Bin generally model a piece-wise constant approximation of these start-up costs, we 
use the unapproximated values CU-(/) for comparability with Temp. 

Each of the start-up cost formulations models cuf differently: 

• In 1-Bin (Carrion and Arroyo (20061), the start-up costs cu! in period t are modeled as 


Ct4>cu K0(v/ 

Mi] 


Vi£j r ,t £ [T],l £ [0 .. t- 1], 


(40) 


and the summed start-up cost of unit i is cuf := Y.te\T cu f 

Similarly, 1-Bin* (|Silbernagl et al. (2014)) models cuf := Y*te[T] cuf with 


t-j 


Vi£j?,te [T],le [0 (41) 


cu! > CUj(/)v/ - £ CU'(/) - CU-(j-1 

MV 

3-Bin (Simoglou et al. (20100 uses the start-up and shutdown indicators (Garver ( 1962| >), 

V if PDT, > 0, 

(42) 


sui/ — sdi/ = 


vf — 1 else, 
sui( — sdi! = vf — v/ _1 
sui!,sdi! >0 


ViG/,iG[2.T], 
Vis/,i e [T]. 


In the special case of unapproximated start-up costs, this formulation simplifies to 


sui!= £ 5/(/) 

Mt-i] 

m< sdij-' 

m >o 

euf:= £ I CUi(05/(0 

te[T}l€[t~i\ 


v / e t e [r], 

V/E J,t £ [T],l £ [t- 2], 
ViS/,t6 [T],l£ [f-1], 
V / e J. 


(43) 

(44) 

(45) 

(46) 

(47) 

(48) 


Temp (Silbemagl et al. (2014)) also uses the start-up and shutdown indicators in (|42])-(|44]), 
and models the start-up costs as 


temp, 1 = e ' t,PDT ' + hff 
temp! +1 = e _ ^ iL temp! + (1 - e _Ail/ )v/ + h\ 
vf < temp! < 1 

temp! > vf + e - A ' OL '( f - 1) (l - vf) 
temp! > v/ + e“ A ' OLf(/ ((temp! _/ - v/) 
h\ > 0 

cuf :=Fj Y sui! + U ( Yj h ‘i 
te[T] fe[o.. r-t] 


V/GJf (49) 

s li£j,t £ [r-1], (50) 

Vi£j r ,t£[T\, (51) 

\/i£y,t£[T], (52) 

Vi£j r ,t£ [T],l£ [t-2], (53) 

Vi£j,t £ [0 .. r-1], (54) 

Mi£j. (55) 


Finally BTI denotes the ^-representation of epi(LCUf) in Theorem 


19 


25 





























8.4 Integrality Gap 


Fig. [13] shows the integrality gaps of all formulations for 14 test cases with 72 one-hourly 
periods, i. e. T = 72 and L 1 = ... = L r = 1 hour. The modeled time ranges are spread 
uniformly over the year 2025, starting in its 5-th hour with 5 G {433 + 624 j \ j G [0.. 13]}. 
The values of 5 are chosen such that each time range starts at midnight, and exactly two time 
ranges_start on each day of the week. 


Silbernagl (tion) shows that Temp is an extended formulation of epi(LCUf), reducing 
the number of constraints by introducing the additional variables temp], /;' 1 and sui] for all 
i G t G [T]. Hence its integrality gap is equal to that of BTI, up to numerical inaccuracies. 
Both lead to the smallest integrality gap, which is expected since all start-up cost formulations 
model a set which encloses epi(LCUf); The integrality gaps of 3-Bin, 1-Bin* and 1-Bin 
are significantly larger on average (Fig. [13} left chart). By expressing their integrality gaps 
relative to BTI, we see that BTI and Temp dominate the other models. (Fig. [13} right chart). 


7% 


5% 


3% 


1 % 

0 % 



1-Bin 1-Bin* 3-Bin Temp BTI 1-Bin 1-Bin* 3-Bin Temp BTI 

(a) Integrality gap (b) Integrality gap relative to BTI 


Fig. 13: The integrality gap of the start-up cost formulation for 14 test cases, absolute (left 
chart) and relative to BTI (right chart). BTI and Temp provide equal integrality gaps, up to 
numerical inaccuracies, and dominate the other models. 


Despite these results, augmenting 1-Bin, 1-Bin* and 3-Bin with binary tree inequalities 
as cutting planes does not improve the solution times considerably. While 1-Bin was able to 
solve more test cases within a time limit of 1 hour when paired with BTIs, no advantage was 
determined for 3-Bin. The cutting plane algorithm of the BTI exhibits a very slow convergence 
of the lower bound and numerical problems, leading to long computation times for each 
reoptimization step in the IBM ILOG CPLEX and the FICO Xpress Optimizers. Further 
research on the selection and implementation of the BTIs might overcome the numerical 
problems encountered. 

Temp however does not suffer from the numerical problems and the slow convergence 
of the BTIs, and, as substantiated experimentally in Silbernagl et al. ( j2014j ), outperforms 
the start-up cost formulations in current literature. Still, the description of epi(LCUf) we 
provide in this paper is the theoretical basis for Temp, and in particular justifies the separation 
algorithm for inequalities d52b,([53 1 . 
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9 Conclusion 


We were able to derive a complete ^-representation of the epigraph of the summed start-up 
costs. The resulting Unit Commitment formulation significantly outperforms two state-of- 
the-art formulations in terms of the integrality gap. While applying the BTIs in a cutting 
plane algorithm does not improve the solution times, they provide the theoretical basis for an 
efficient extended formulation. 

In future work one may also take into account the interdependencies of multiple units 
for the development of further inequalities. Consider for example a demand-driven Unit 
Commitment problem: during a shaip increase in demand, some number of units may be 
forced to staid up, resulting in a bound on the total start-up costs of all units, which has to be 
modeled by multi-unit inequalities not considered in any Unit Commitment model so far. 
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